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Abstract 

It was shown in [3] that the least concave majorant of one-sided Brownian motion without 
drift can be characterized by a jump process with independent increments, which is the inverse 
of the process of slopes of the least concave majorant. This result can be used to prove the result 
in |20j that the number of vertices of the smallest concave majorant of the empirical distribution 
function of a sample of size n from the uniform distribution on [0, 1] is asymptotically normal, 
with an asymptotic expectation and variance which are both of order log n. 

A similar (Markovian) inverse jump process was introduced in [5J, in an analysis of the least 
concave majorant of two-sided Brownian motion with a parabolic drift. This process is quite 
different from the process for one-sided Brownian motion without drift: the number of vertices 
in a (corresponding slopes) interval has an expectation proportional to the length of the interval 
and the variance of the number of vertices in such an interval is about half the size of the 
expectation, if the length of the interval tends to infinity. We prove an asymptotic normality 
result for the number of vertices in an increasing interval, which translates into a corresponding 
result for the least concave majorant of an empirical distribution function of a sample of size n, 
generated by a strictly concave distribution function. In this case the number of vertices is of 
order cube root n and the variance is again about half the size of the asymptotic expectation. 

As a side result we obtain some interesting relations between the first moments of the number 
of vertices, the square of the location of the maximum of Brownian motion minus a parabola, 
the value of the maximum itself, the squared slope of the least concave majorant at zero, and 
the value of the least concave majorant at zero. 
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1 Introduction 



It was shown in [3j that one-sided Brownian motion can be generated by a jump process with 
independent increments (which is the inverse of the process of slopes of the least concave majorant) 
together with Brownian excursions between successive vertices of the least concave majorant. This 
decomposition of Brownian motion, using the inverse process, was also analyzed in [T7], where 
certain path decomposition results, introduced by David Williams, were applied. 

The study of the (least) concave majorant of Brownian motion in [3] was actually motivated 
by the wish to give an alternative derivation of the asymptotic distribution for certain statistics, 
studied in pQ and |19j . and first proved in |10j . where the asymptotic distribution was found by 
analyzing the spacings, induced by the least concave majorant of the empirical distribution function. 

As a side effect, j3] also threw some new light on a result of [20], which is stated below. A 
straightforward proof of this result, using characteristic functions and the Poisson representation 
in [10], is given in [9]. 



Theorem 1.1 [Sparre Andersen (1954)] Let N n be the number of vertices of the least concave 
majorant of the empirical distribution function of a sample of size n from the uniform distribution 
on [0, 1]. Then 

N n -lo g n v^ mi)} 



n 



where — > denotes convergence in distribution, and N(0, 1) is the standard normal distribution. 

The corresponding result for the Brownian bridge on [0, 1], which follows from [3], is: 

Theorem 1.2 Let C be the least concave majorant of the Brownian bridge on [0,1]. Then the 
number of vertices N n of C on the interval [1/n, 1 — 1/n] satisfies: 

N n -\o g n v^ mi) 



v^og 



n 



If one studies more closely "where the action is", in the sense that the number of vertices 
increases to infinity, it turns out that all the action is near and 1: on an interval [e, 1 — e], where 
e > 0, there will, with probability one, only be finitely many vertices. 

The situation is strikingly different for the least concave majorant of two-sided Brownian motion 
minus a parabola. Here the action is "the same everywhere" , and the point process of locations of 
vertices is stationary. The real purpose of the papers [1] and [6] was to analyze the (stationary) 
process 

{V{a) - a : a € R} , (1.1) 

where 

V(a) = argmax teM {W(t) - (t - a) 2 } , a £ R, 

and W is standard two-sided Brownian motion, originating from zero. The process V itself is a 
pure jump process, which runs through the locations of the vertices of the least concave majorant. 
At points where V{a) is not uniquely determined (which happens if a is a slope of the least concave 
majorant), we take the largest value t at which W(t) — (t — a) 2 is maximal. In this way the process 
V becomes right-continuous. 



The infinitesimal generator of the process (1.1) is given in Theorem 4.1 of [6], where it is 



expressed in terms of Airy functions. However, most attention has been for the result on the 
distribution of ^(0), which gave an analytic expression for the limit distribution of a whole class of 
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Figure 1: The least concave major ant of W(t) — t 2 . 



so-called "isotonic estimators" , for example the pointwise limit distribution of an estimator of the 
mode, discussed in [2], and the pointwise limit behavior of the Grenander (maximum likelihood) 
estimator of a decreasing density, see, e.g., [18] and [5J. 

For results on global functionals, however, like the L\ or L2 distance of the Grenander estimator 
to the underlying density, or the number of its jumps, one needs information on the whole process V, 
and not only on its pointwise behavior. In this paper we will show how one can extract information 
from Theorem 4.1 in [6] in the derivation of a central limit theorem for the number of points jump 
of V in an increasing interval. The result has a rather large number of applications in statistics, but 
we will only sketch one such result for the number of points of jump of the Grenander estimator 
(which is equivalent to the corresponding result for the number of vertices of the least concave 
majorant). 

Our main result is the following central limit theorem, which is proved at the end of section [2j 
Theorem 1.3 Let N[a,b] be the number of jumps of the process V in the interval [a,b]. Then 



where k\ ~ 2.10848 and ki ~ 1.029, and N(0, 1) is the standard normal distribution. 

Perhaps somewhat remarkably, the difference between the results for the least concave majorants 
of one-sided Brownian motion without drift and two-sided Brownian motion with a parabolic drift 
has its counterpart in the difference between the convex hulls of uniform samples of points from 
the interior a convex polygon and from the interior of a convex figure with a smooth boundary, see 
[5]. In this case one also meets the rates log re and n 1 / 3 for the number of vertices of the convex 
hulls of the samples, with corresponding central limit results. 
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2 The number of jumps of the process V in an increasing interval 

Although [6 J has the simpler conceptual characterization, the characterization in [1] might be more 
useful for numerical computations. It was also used in where Chernoff's density and its 
moments were computed. Chernoff's density is the density of V(0), which often occurs as limit of 
isotonic estimators and in particular in the limit distribution of the Grenander estimator. Here, 
however, we take [B] as our starting point. 

The process (1.1) is completely characterized by Theorem 4.1 of [6]. As a corollary we have the 
following result for the jump measure. 

Theorem 2.1 We have, if y > x, 

lim h~ l F {V(a + h)ea + dy\V(a) = a + x} = 2 ^ ~ ~ g) d 

hio g[x) 

where g has Fourier transform 

g(u) = J e-g(x) dx = - , u G R, (2.2) 

and 

p(u) = p (u) + (2vni 3 )~ 1/2 ,u>0, 

where po has Laplace transform 

, s 2 2 /s A i'(2- 1 /s„) 
W( " )= Ai(2-^) +2 ^ 

Remark 2.1 Note that we define the Fourier transform in the "probabilistic way" , in analogy with 
the definition of the characteristic function of a probability distribution. 

A picture of the function g, using the representation 

2 roc g—iux 

9{x) = ¥JhJ^ Ai(i2-1/3 U ) 

which follows from (2.2), is shown in Figure [2j 
The function p has the representation 



oo 

p(u) = 2Y j e 2l/3dnU , u > 0, (2.3) 



n=l 



where the d n are the zeros of the Airy function Ai on the negative halfline, see (4.12) in [6j. This 
expansion is divergent at zero, however, where we have: 

p(u) ~ (2iru 3 y 1/2 , n|0, (2.4) 

see part (ii) of Lemma 4.2 in [6]. This is the reason for considering the regularization 

P0 (U) = P (U) - (2TTU 3 )- 1 / 2 , 



and for o nly u sing the re pres entation (2.3) for u > 1. If u < 1 we use the representation given 



below in (2.11) of Lemma 2.2 The function u i— > u 3 ^ 2 p(u) is shown in Figure [3] 

For later purposes we summarize in the following lemma some properties of functions p and g 
and the random variable V(0). This also gives still another regularization of the function p. 
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Figure 2: The function g. 
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Figure 3: The function u h-> u 3 ^ 2 p(u), u > 0. 



Lemma 2.1 Lei the function p be defined as in Theorem 2.1, Then: 
(i) The function p\ , defined by 



pi 0*0 



xp(x) , x > 0, 







x < 0, 



/ias Fourier transform 



(ii) The function h, defined by 



2 V3 Ai // 2-l/3 in ) 2 



Pi(u) = iu + 

Ai(-2-i/3^) 2 



h(x) = / g(x + u)up(u) du, i£R, 



/ias Fourier transform 



h(u) 



2 l 'Hu 2 2 ^A\'{2- 1 lHuY 



Ai(2- 



1/3 



+ 



Ai(2 



(Hi) The random variable V(0) has characteristic function 



Ee itV(0) = 2 2/3 f°° *f t G R / 2 g ^ 

2^ 7 U= _ 00 Ai(«*)Ai(i(t + u))' 



(w,) T/ie random variable V(0) /ias expectation zero and second moment 

2-2/3 /-co 



^y(O) 2 = / =• du « 0.26355964. (2.10) 

6vr y^-oo Ai frar 



Proof. Part (i): this immediately follows from Theorem 2.1, noting that the function x h-> 1 /V 2ttx 



has Laplace transform \j\J2u and by switching from Laplace transform to Fourier transform. 



Part (ii): this follows from (i) and Theorem 2.1, by noting that the convolution turns into the 



product of the Fourier transforms (with an added change of sign for the Fourier transform of p\). 
Part (iii): this is the Fourier transform of the function 

x i y ^g(—x)g(x), 

which turns into the convolution of the Fourier transform of x i— > g{—x) and Fourier transform of 

x i — y g(x) times (4vr) — 1 . 

Part (iv) follows from the formula 



n-2/3 j2 r oo j 

EV(0) 2 = - ' 



2ir dt 2 i„ = _ oc Ai (iu) M(i(t + u)) 



2 -2/3 ,oo r iu 2Ai' (iu) 2 



, n 27r A=-oo lAi(m) 2 Ai(m) 4 



du 



2-2/3 rco r iu 2ra 1 2" 2 / 3 r°° iu 

~iri^- B0 lAn>p " 3Ai(zu) 2 j U -~~toTj u= _ 00 Al(iuf U ' 

where we use integration by parts and the Airy equation for the term involving Ai'. □ 



Remark 2.2 The function p does not have the same meaning in [4J. If we denote the function p 
of [1] by p, we have: 

p (t) = e -^L{p(t)+t- 3 / 2 },t>o. 



/2tt 

The function p has an expansion which converges at zero 



Lemma 2.2 The function p can be written 

a t 3 /6 

where 



Pit) = ^{pit )+ t^}, 



= ~\h E flfct3fc + E M 3(fc - 1/2) . * > o, (2.11) 



2 

fc=0 fc=l 



and the coefficients at and bk are recursively defined as follows. Set cq = 1 and 

_ 4 (2n-3)(2» + l) 
n z (2n — 1 
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Then with oq = 1, 61 = 2/3 ; and B(p,q) = T(p)T(q)/T(p + q), the standard Beta function, set 



'"n — >-n 



n— 1 ^ 

vrfc!f-2^ bn - fc£?(3n " 2/c " 1/2, * + 3/2) ' n = h 2 ' ' ' ' 



fc=0 



n— 1 - 

6 ™ = Z fc! (_ 2 )fc+i a "- fc - l£?(3n " 2/c " 2 ' k + 3/2) ' n = 2 ' 3 ' 



fc=0 



Proof. This follows from Remark 2.2 and Theorem 4.2 in [2]. 



Let u 2 '■ 



be defined by 



u 2 {x) 



2x 



1 



'2tT Jo 



P(y) exp(-ly(2x + y) 2 )dy 



if x G [— 1, 00), and 



u 2 (x) = exp (fx 3 ) 4 1 / 3 ^exp(-2 1 / 3 a fe x)/Ai / (a fc ) 



k=l 



if x G (—00, —1]; here Ai' is the derivative of the Airy function Ai. 
The notation u 2 is used because u 2 has the interpretation 

d 

u 2 (t) = lim —u(t,x), 

x^t 2 OX 



where u(t, x) is the solution of the heat equation 





dt 



u(t, x) 



dx 2 



2 :>^? U (t' X )l 



for x < t 2 , under the boundary conditions 



(2.12) 
(2.13) 



□ 



1 f°° 

+ — = / {4x 2 + 8xy + 3y 2 }exp(-±y(2x + y) 2 )y- 1 / 2 dy (2.14) 

V27T Jo 



(2.15) 



u(t,t 2 ) d = hmu(t,x) = 1, lim u(t,i) = 0, tel. 

xfi 2 xj,— 00 

This function occurred in the paper [2], where the density of the location of the maximum of 
Brownian motion minus a parabola was first characterized. 

The following result summarizes the correspondence between the results in [3] and [6]. 

Theorem 2.2 (i) We have, ify>x, 

limh- 1 F{V(a + h) G a + dy\V(a) = a + x} 

hlO 

2{y - x)u 2 {y)e~l {y - x){x+y)2 {p{y - x) + (y - x)' 3 / 2 } 



u 2 (x)\/2tt 



dy, 



where u 2 is defined by (2. 14) and (2.15) and p by (2.11). 
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(ii) 

U2 (x) = e3 x3 g(x), i£R and p{t) = V2^e~ t3/6 p(t) - t" 3/2 , t > 0, 
where the functions g and p are defined as in Theorem \2.1\ 
(Hi) The density ofV{0) is given by 

fv(0){x) = \u 2 {x)u 2 {-x) = \g{x)g{-x), x € R, 



where g is defined as in Theorem 2.1 



Remark 2.3 In [TTj part (Hi) of Theorem 2.2 was used in the computation of the Chernoff dis- 



tribution. The function u 2 corresponds to the function k\ in [6] and part (i) of of Theorem 2.2 



corresponds to the first version of the infinitesimal generator of the process V(a) — a, given in 
Theorem 4.1 of that paper. 

It is seen from Theorem 2.2 and (2.14) that the function p (or alternatively, the regularization 
p) is the fundamental function; both the jump measure and the density of V(0) are expressed in 
terms of p. 

We are now ready to compute the expectation of the number of jumps of the process V in an 
interval (of slopes) [a, b], where we use similar techniques as in [5], which dealt with convex hulls 
of samples of points from the interior of a convex set in the plane. 

Let the function cf) be defined by 



<j)(x) 



g(x + u)up(u) 



du, x G 



(2.16) 



(x) is the integrated jump measure, starting from position x. Moreover, let N[a, b] denote the 



number of jumps of the process V in the interval [a, b\. Then Theorem 2.1 tells us that 



N\a, b] 



(j)(V{c) — c) dc, b > a, 



(2.17) 



is a martingale w.r.t. the filtration, generated by V(b), b > a. As a consequence, we have the 
following result. 

Lemma 2.3 Let N[a,b] be the number of jumps of the process V in the interval [a,b]. Then 

EN[a,b] = ki(b-a), 

where 



g{-x) dx / g(y)(y - x)p(y -x)dy « 2.10848. 

i J y=x 



(2.18) 



Proof. We get from Theorem 2.1 



EN[a,, 



E(j){V{c) - c) dc. 



8 



Using the stationarity of the process c h- > V(c) — c we get: 



E<t>{V(c) -c)dc = J E(f>(V(0)) dc 

Mrw*r i,Mt :f-" * 



-oo 
oo 



(6 -a)/ 



-oo 
oo 



3(2;) 



(b-a) g{-x)dx g(y)(y - x)p(y - x) dy, 



and the result follows. The constant k\ was determined numerically by using Theorem 2.2 



□ 



Remark 2.4 As one of the referees remarks, Fourier analysis, applied on the right-hand side of 



(2.18), gives: 



fel 



2 5/3 
6-7T 



IU 



Aif2-V3 



2 7/3 
6"7T 



Ai(i«) 4 



du 



8EV(0) 2 = IE max {W(t)-t 2 ), 



(2.19) 



where W is standard two-sided Brownian motion, originating from zero, and V(0) is defined as in 



(1.1), for a = 0. This follows from Lemma 2.1, since we get by Parseval's formula: 



g(-x)dx g(y)(y - x)p(y - x)dy 

Jy=x 



1 

2TT 



1 

2^ 



2Ai / f2^ 1 /3 



*2 2 / 3 



u 



Aif2-V3 



in 



Ai(2- 1 /3j„)' 



(in 



g(—u)h(u) du 

25/3 /-oo 
67T 



Aif2-V3 



dn, 



IU) 



where we use integration by parts and the Airy equation for the term involving Ai' as in the proof 



of part (iv) of Lemma 2.1 



This also gives an interesting relation between the moments of the location of the maximum 
and moments of the maximum itself. By [13] we get: 

£max (W(t) - t 2 ) = 2~ 1/3 EM = 0.790679, 



where M is the maximum of W(t) - p 2 , see (1.7) and (2.5) of their paper, which is in accordance 
with the value, given in Lemma 2.3 The integral representation for the maximum of W(t) — t 2 of 



type (2.19) above corresponds to (2.1) in their paper (after replacing W(t) — t 2 by W(t) — it 2 , see 



also Remark 2.5 below). The value EV(0) was computed in [llj, where it is given by 0.26355964 



(note that this is also given in part (iv) of Lemma 2.1), and this gives k\ = 2.10848 again. For 
convenience, we state this in a separate lemma. 



Lemma 2.4 The constant k\ in Lemma 2.3 has the representation 

ki = 8EV{0) 2 = §£max (W{t) - t 2 ) w 2.10848. 



(2.20) 
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Remark 2.5 Note that 

argmax t6M {W(t) - ct 2 } = argmax, eK (V 1 / 3 S^W(c 2 / 3 t) - (c 2 'h) 

= argmax teR <jV(c 2/3 i) - (c 2/3 i) 2 | = cT 2/3 argmax u6R {W{u) - u 2 } , 

which implies that, if we define fci(l) = k\ and ^(1) = &2j and denote the corresponding constants 
for the process t \-t W(t) — ct 2 by fci(c) and /02(c): 

h(c) = c 2/3 k u i = 1,2. 



Relation (2.19) changes into: 

8c 

jfci ( c ) = 8c 2 £V c (0) 2 = — E max (W(*) - ct 2 ) , (2.21) 

3 tew 

where a 1— > V c (a) is the process of locations of maxima of 1 1— )■ VF(i) — c(t — a) 2 . 

Remark 2.6 As also pointed out by one of the referees, |15j represent the constant fci(l/2), where 
&x(l/2) is defined as in Remark 2.5 in their Corollary 4 as the sum of two constants: 

k 1 {l/2) = EX(0)+EX'(0) 2 , (2.22) 

where X(t) = W(t) — \l? and X is the greatest concave majorant of X, with slope -^'(O) at zero. 
(I switch from the convex minorants of Brownian motion plus a parabola to the concave majorants 
of Brownian motion minus this parabola here; this gives the same k\). [12] give simulation results 
for X(t) = W(t) - U 2 , which would imply that h(l/2) w 1.289. We get from Remark 



2.5 



fci(l/2) = 2" 2/3 A;i(l) rj 1.32826, 

which is larger than the value arising out of the simulations in [T3], indicating that it is very hard 
to obtain precise values of these constants by direct simulation of Brownian motion. 

Denoting the least concave majorant of the process 1 1— > W(t) — ct 2 by X c , and using a notation 
similar to the notation of Remark |2,5[ we would get the relation 



fei(c) = 2cEX c (0) + EX' c (0) 2 , (2.23) 
which indeed is compatible with the relation: 

fc 1 (c) = c 2 / 3 A: 1 (l). 

Remar k 2. 7 There exists a simple relation betw een V c (0) and X c (0), where V c is defined as in 



Remark 



2.5 



2.6 



This follows from the so-called "switch relation" : 



and X' c (0) is defined as in Remark 

X' c (0) < 2ca V c {a) > 0. 

Since 

> 0} =F{V c (a) - a > -a} =F{V c {0) > -a} = P{2cV c (0) < 2ca} , 
we get that X' c (0) and 2cV c (0) have the same distribution, and hence: 

EX' c (0) 2 = 4c 2 W c (0) 2 . 
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Combining the preceding remarks, and in particular assuming that (2.23) (or (2.22)) holds, we 
obtain: 

fei(c) = 2cEX c (0) + Ac 2 EV c (0) 2 = 8c 2 EV c {0) 2 , 

which implies 

EX c {0) = 2cEV c {0) 2 = |Emax{W(t) - ct 2 } . 

The asymptotic behavior of the functions p and <j) is given in the following lemma. 
Lemma 2.5 We have: 
(i) 

p{t) ~ 2e 2l/3Slt , t -> oo, 
where a\ is the first zero of the Airy function Ai on the negative halfline. 

(ii) 

pit) ~ (27Tt 3 )- 1/2 , HO. 

</>(*) ~ 2t 2 , f -)• -oo, 0(t) ~ i ->■ oo. 



Proof, (i) follows from (2.3) and (ii) is the same as (2.4), which follows from (4.17) of Theorem 
4.1 in [6j. 

(iii). The function g is denoted by g\ in [6], and hence, according to part (i) of Corollary 3.4, [6]: 

~ -2V3a n |t| 

where the a n are the zeros of the Airy function on the negative halfline. 

We now have, using part (i) of Corollary 3.4 and part (ii) of Lemma 4.2, [6j, if t < 0, 



1*1 2up(u)g(t + u) d ^ e^j+j/Ar^ ~ 

" E~ie 2l/3a "l*l/Ai'(d n ) ^ 

4 f'S^I^^ 

/ E oo =i e 2V3 {Sl - 5n} t /Ai / (5n) A, 

|t| °° „2 1 /3{ 5l _a n }(t +u ) 







OC 



o i Ai'(a n ) 

n=l v ' n=l 



1 /3{a 1 -a„}« I du 



~4 /'"nJl + y e -^«-M4 dn + 4 /' l \y e 2V3 {ai _, n}(i+M )Ai> 1 ) ^ 
Jo \ t* 2 I 7o Ai'(5 n ) 

~ 4 / udu = 2t , t ->■ -oo. 



o 1 „_o I JO 

1*1 



n=2 J " u n=2 

2 
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Furthermore, 



00 2up(u)g(t + u) ^ = r 2(\t\+u)p(\t\+u)g(u) ^ 



g(t) Jo g(t) 

POO 00 

~ 4 2 / 3 Ai'(a 1 ) / e - 2l/3 ^l*l V e 2l/3 MI*l+«} {\ t \ + u} g(u) du = 0(\t\), t -> -00, 
J n=l 

and the first part of (iii) now follows. 

Using Laplace's method and part (ii) of Corollary 3.4, [6], we find: 

2up{u)g(t + u) du ^ 2 r°° up(u)(t + u)exp{-l(t + u) 3 } ^ 



g(t) ' Jo texp{-|t 3 } 

- x l\t + u) exp {-2t 2 u - 2tu 2 - fu 3 } 

— - du 

2vr Jo t 

2 f°° u~ l / 2 (t + u)exp{-2t 2 n} 1 

' du rsj — . t — y 00. 



2vr Jo t t 

□ 



150 - 



100 - 




-10 
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Figure 4: The function 4>. 

For the variance of N[a, b] we get the following result. 
Theorem 2.3 Let N[a,b] be the number of jumps of the process V in the interval [a, b\. Then: 

vax(N[a, b]) = EN [a, b] + 2 covar (4>{-V (a) + c x ) , (j) (V(c 2 ) - c 2 )) dc x dc 2 . (2.24) 

J a<ci<C2<b 

12 



Proof. We have: 

EN(a, b + h) 2 - EN [a, b} 2 = EN(b, b + h) 2 + 2EN(b, b + h)N[a, b] 
~ EN(b, b + h) + 2EN(b, b + h)N[a, b] 
~ hE<j){V{b)) + 2hEN[a, b]4>{V{b) -b),hi 0, 

and hence: 

f b 

EN [a, b] 2 = EN[a, b]+2E j N[a, c]4>{V{c) - c) dc. 

J a 

Moreover, using an obvious time reversal argument, also used in [5], we get: 

E f N[a,c]<j)(V(c)-c)dc = E [ {! 4>{-V( Cl ) + c x ) dc x \ 4>{V(c) - c) dc 



Note: 



a W a 



E {</>(-V(d) + ci)0(y(c 2 ) - c 2 )} d Cl dc 2 . 

a<ci<C2<b 



E {4>{-V{ Cl ) + c x )4>{V{c 2 ) - c 2 )} d Cl dc 2 

a<ci<C2<b 

cov (0(— V (ci) + ci) , (V(c2) - c 2 )) dc\dc 2 

a<ci<C2<b 



f 

J a 

f 

■J a 



+ [ E0(-V(c 1 ) + c 1 )E<f>(V(c 2 )-c 2 )dc 1 dc2 

J a<ci <co<6 



<ci<C2<6 



' a<ci<C2<b 

cov (^(— y (ci) + ci) , (V(c 2 ) - c 2 )) dc\dc 2 



+ (£0(V(O))r / dcidc 2 

./ a<ci<C2<6 

cov ((f)(-V (ci) + ci) , (F(c 2 ) - c 2 )) dddc 2 + §A;f (6 - a) 5 

<c±<C2<b 



so we get: 

var(7V[a, b}) = EN [a, b] + 2 f cov (<j>(-V (ci) + ci) , (j) (V(c 2 ) - c 2 )) cfcicfc 2 . 

J a< 



i<c\<C2<b 



□ 



We prove in the sequel that the dependence between (p(—V(ci) + c\) and <j)(V(c 2 ) — c 2 ) dies 



out exponentially fast, as c 2 — c\ — > oo, which, together with part (iii) of Lemma 2.5 gives that 
the covariance of 4>(— V (c\) + c\) and 4> (V(c 2 ) — c 2 ) dies out exponentially fast, as c 2 — c\ — > oo. 
Hence we get: 

var(iV[a, b]) ~ k 2 (b — a), b — a — > oo, 

for a constant fe 2 > 0. 

There are several ways in which one could try to determine the constant k 2 . One possible ap- 
proach is to use an integro-differential equation to determine the constant k 2 , following a suggestion 
on p. 546 of [I]. Let the function k(a,t) be defined by 

k(a, t) = E (0(V(O)) | V(a) =t},a<0. 
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Then we have, for a < 0, 



k(a,t) = E{<j>(V(0)) | V(a) = t} 

= E {E {^(V(0)) \V(a + h)}\ V{a) = t] 

h r U + ^ , 2up(u)g(t-a + u) 

= h k{a,t + u) r du 

Ju=o 9{t - a) 

, J, , f°° 2up(u)g(t-a + u) ,] 

+ k(a + h, t ){ 1 - h / ^ ! , r du \ + o(h) 

{ Ju=o 9(t-a) J 

= k(a + h,t) + h r {k{a, t + u)- k(a, t)} 2up ^^ ~ a + n ) du + ,ja h | _ 
Ju=o 9{t-a) 



Hence we get: 



— k(a, t) = - {k(a, t + u)- k(a, t)} y 1 , r du, 

da J u=0 git - a) 



which leads to the integral equation 

W {fc ( M + u )_ fc ( M ) } *^ 
iu=o 9{t - b) 



k(a, t) = k(0, t)+ / ° db f°° {k(b, t + u)- k(b, t ^ 2u P( u )9(t b + u ) du ^ a < 



Note that 

E {<f>(V(0)) | V{a) -a = t} = k(a, t + a). 

Using this approach, the constant &2 was approximated numerically, on a grid with stepsize 1CP 3 
in both coordinates on the interval [—10, 10], using the boundary condition 

k(0,t) = 4>{t), t £ R, 

and replacing integrals by Riemann sums. In this way we obtained: 

POO 

2 / cov(^(V(0)), <f>(-V(a) + a))da « -1.11891, 
Jo 

which would give: ki ~ 0.986. However, since the numerical computations seemed somewhat 
unstable, we have more trust in the value obtained by simulating the vertex process directly, 
without first generating Brownian motion, in the way described below. 

One could also try to determine an approximate value of the constant &2 by simulating Brownian 
motion directly. However, since one needs very long intervals (or, alternatively, a rescaling which 
also leads to very computation-intensive simulation), it is doubtful that we get a good approximation 
in this way. See also the discussion in Remark 2.6 on the constant k\(l/2), obtained by simulating 
Brownian motion directly in [15J, which gave the value 1.289, while the analytically determined 
value is 1.32826. 

We can use Theorem 2.1 to generate the process {V(a) : a G M} without first generating 
Brownian motion. This method of generating the vertices was also used in [16] and [7], for generating 
the vertices of convex hulls of Poisson processes of points in the plane, and seemed to work rather 
well in that situation. 

We start the process at time zero, by generating V(0) according to the "Chernoffian" distribution 
fv(o), given by: 

fv(o){x) = \g{-x)g{x), xeR, 
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where g is defined as in Theorem 2.1 Suppose this gives V(0) = x. Next we generate the waiting 
time until a jump according to the distribution function 

Fx{o) = 1 — exp < — / 4>(x — b)db> = 1 — exp < — / cj>{u) du > , a > 0. 

I Jb=0 J I Ju=x-a J 

where 4>(u) is the integrated jump measure, starting from position u. Suppose that this gives the 
jump time a > 0. Then we generate a jump according to the jump density 

2g(x - a + u)up{u) 

u H- — — — , u > 0. 

g{x — a)(p{x — a) 

This gives a new position y from which we generate a waiting time, according to the distribution 
function 

f f y ~ a 1 
Fy- a (b — a) = 1 — exp < — / </>(?/) ciu > , b — a > 0, 

I J u=y—b ) 

which gives a new jump time b > a, from which we generate a jump according to the jump density 



u ^ 2g(^ - b + tt) M p(u) > ^ 
g(y - b)4>{y - b) 



and so on. Defining 



we can write: 



y-a\ 



lu=0 ^( n ) du ' X - °> 

- J u ° =x , x < 0, 

1 - exp {- {$(y - a) - $(y - 5)}} . 



We indeed used this procedure to generate the process V. Instead of the jumps lengths them- 
selves we generated the square roots of the jump lengths, which have a bounded density, in contrast 
with the jump lengths, which have a density which is unbounded near zero. This enabled us to 
generate the square roots of the jump lengths by rejection sampling, since we can compute the 
density from the theory above (but note that this density depends on the value of x — a, so we get 
a family of densities, parametrized by x - a). 

The waiting times between jumps can be generated using the following observations. Note that, 
for a uniform random variable U: 

F x {u) = F{U < F x (u)} = P{- log{l -U}<- log{l - F x (u)}} 
= P{-log{l-C/} < $(3) - ®{x - u)} = F{<Z>~ 1 (W) <u}, 

where W is standard exponentially distributed and is the inverse of the function 

u h-> <3?(x) - <&(x - u), u > 0. 

Hence the waiting times between jumps can be generated by generating the random variables 
&~?_ a (W), where W has a standard exponential distribution; $^_ a was computed on a equidistant 
grid, with distance 10 -3 between successive points of the grid, and with linear interpolation between 
points of the grid. In this way we found in 10, 000 simulations, where a ran through the interval 
[0, 10 4 ]: ki « 2.1082 (note that this is very close to the analytically determined value 2.10484) and 
k 2 « 1.029. 

The alternative characterization of the jump process, used in the simulations, is given in the 
following theorem. 
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Theorem 2.4 The process {V(a) : a £ M} is a Markovian pure jump process, where the jump 
density at time a is given by 

u -> ^ " , ti > 0. (2.25) 

g(x — a)(p{x — a) 

given V(a— ) = x, and where the distribution function of the waiting time till the next jump is given 
by 

F x „ a (b-a) = 1-expi- / du i , 6 - a > 0, (2.26) 

I Ju=x—b J 

given V(a) = x. 



Remark 2.8 By part (iii) of Lemma 2.5, we have: 



4>{—u) ~ 2u 2 , u — > oo. 

This yields, for fixed x, a £ M, 

c— a 

du ~ |(6 - x) 3 ~ |6 3 , b -> oo, 



implying 

l,,„ J 1 _ /■' ;/, _ „u - _ 
This is in accordance with: 



log{l-F x _ a (6-a)} ~ -U 3 , b^oo. 



log(l-P{|V(a) -a\> t}) ft 3 , t -»• oo 

see Corollary 3.4, part (iii), in [6]. 

We summarize our findings on the variance in the following lemma. 
Lemma 2.6 Let N[a,b] be the number of jumps of the process V in the interval [a,b]. Then 

var(A r [a, &]) ~ k2(b — a), as b — a — ^ oo, 

where 

r° 

k 2 = ki + 2 I cov(0(-V(6) + 6), 0(V(O)) db « 1.029, 

J — oo 



and is defined by (2.16). The value of the constant k 2 was determined by simulating the vertex 
process directly in the way described above, using Theorem 2.4. by 10 4 runs on the interval [0, 10 4 ]. 



For the central limit result, we also need the following lemma. 
Lemma 2.7 The process V(a) : a £ M} is strongly mixing with strong mixing function 

a(d) = cexp { — ^d 3 } , 
for a constant c > 0. More specifically, for arbitrary a G R we have: 

sup |P(>4 n B) - P(>4)P(5)| < cexp {-^d 3 } , 
for all A £ <r{V{b) : b < a} and B £ a{V(b) :b>a + d}. 
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Proof. The proof proceeds along similar lines as the proof of Theorem 3.3 in [8]. Consider, for 
cti, . . . , Ofe < a and hi > . . . b\ > a + d, the events 

Ei = {V{ ai ) Gii..., V{a k ) G A k } , E 1 = {V(h) G B 1 . . . , V(b e ) G B e } , 

for Borel sets A%, . . . , j4&; and Si, ... , S^. Define 

M = \d, V M (b) = argmax| t _ 6 |< M {W(t) - (t - 6) 2 } , 

and consider the events 

E' 1 = E 1 n {V{a) = V M {a)} , E' 2 = E 2 n {V(a + d) = V M (a + d)} . 

By monotonicity, the event E[ only depends on the increments of Brownian motion before time 
a + M, and the event E' 2 only depends on the increments of Brownian motion after time a + d — M. 
By the definition of M and the independent increments property of Brownian motion, this implies 
that the events E[ and E' 2 are independent, and hence 

P (E[ n E 2 ) = F(E[)¥(E 2 ). (2.27) 

Furthermore, by Corollary 3.4 of [6] we get: 

F{E[ / Ei} < P{T/(a) ^ F M (a)} < 2P{F(a) > a + M} 

28/3 /«oo , . 

= 2P{F(0) > M\ ~ —7- — - / Itlexp^ -||i| 3 + 2 1/3 ai|t| } dt 
Ai (ai) Jm l j 

< A .,,_ . / t exp{-| t 3 } e"3 M <6e _ 3 M ,M^oo. 

" Ai'(ai) J M 11 P1 3 11 ; MAi'(ai) 

where, as before, Ai is the Airy function Ai and d\ its largest zero on the negative halfline. The 
probability P{i?2 ^ E' 2 } can be handled in a similar way. 
Hence we get: 

\¥{E 1 r\E 2 ) -F(Ei)¥(E 2 )\ < P {E[ ^ £ x } + P {E 2 ^ E 2 } < ce~l M \ 
for a constant c > 0, and the result follows. □ 

The following lemma shows that all moments of N[a, b] exist, for all b > a. 
Lemma 2.8 We have: 

Ee lN[a,b) < ^ 

for all I > and all b > a. 

Proof. By the stationarity it is sufficient to prove this for N[0, a]. For I > 0, the process 



a 1 — ^ 



exp llN[0, a] - \e l - l} </>(V(b) - b) db\ , a > 0, 



is a martingale w.r.t. the filtration {T a : a > 0}, where 

T a = <r{V(b), b G [0,a]}, a > 0. 
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So we find 



£exp jziV[0, a] - £ \e l - l} <f>(V(b) - b) db + 4a \e l - l} F(0) 2 | = SeM^- 1 



}v(o) 2 



for each a > 0. Moreover, since, according to part (iii) of Lemma 2.5 

~ 2t 2 , t -)■ -oo, ~ i ->■ oo, 

we have that 

4a \e l - l} V{0) 2 -J \e l - l} <f>(V(b) - b) db 
= |e' - l} {4V{0) 2 - <t>(V(b) - b)} db 

is bounded below, say by —M, where M > 0, using V(0) < V(b), for b > 0. Hence: 

£exp{/iV[0,a]} < Eexp |m + 4a |e z - lj F(0) 2 | < oo, 
for all positive a and Z, since, by part (iii) of Corollary 3.4 of [6], 



fv(o){x)dx < oo, 



for all a > 0. 



□ 



We are now ready to prove our main result. 



Proof of Theorem 1.3. By stationarity, we only have to prove the result for the interval [0, n] 
We have: 



N[0,n] = jr t N[k-l i k] i 



fc=i 



where the N[k — 1, k], k = 1, 2, . . . form a stationary sequence. By Lemma 2.8, all moments of 



iV[0, 1] exist. This fact, together with the mixing condition of Lemma 2.7 imply the result, using, 
e.g., Theorem 18.5.3 of [12]. □ 



3 The jumps of the Grenander estimator 

As an application of the results in section [2] we now discuss the use of these results in deriving the 
asymptotic normality of the number of jumps of the Grenander estimator f n of a strictly decreasing 
density /o on [0, M], M > 0. Note that the number of jumps of the Grenander estimator is the same 
as the number of segments of the least concave majorant of the empirical distribution function, 
which, in turn, is the same as the number of vertices of the least concave majorant minus one. To 
keep the length of the present paper within reasonable bounds, we only give a sketch of the proof. 
Full details will be given elsewhere. 
We have the following result. 
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Lemma 3.1 Let N n the number of jumps of f n , where f n is the Grenander estimator, based on a 
sample of size n from fo, where fo is a decreasing continuous density which stays away from zero 
on its support [0, M], with a continuous derivative f Q , which also stays away from zero on [0, M], 
where one-sided derivatives are taken at the endpoints. Then: 

pM 

EN n ~ km 1 ' 3 / |/^(x) 2 /(4/o(x))| 1/3 dx, n -> oo, 



and ^ 

:{N n ) ~ k 2 n 1 / 3 [ |/ (x) 2 /(4/o(x))| 1/3 dx, n ^ oo, 



var 



where the constants k± and k2 are defined as in Theorem 1.3, that is: 

h Pa 2.10848 and k 2 » 1.029. 
Sketch of proof. Note that, as in [JJ, p. 542, we can introduce locally a process V n , defined by 
V n (s) = C2n 1 / 3 sup ju : ¥ n (t + u) — ¥ n (t) — (a + n~ 1 / 3 cis)u is maximal j , 

where 

Cl = 2{ia|/^(« 70 (a))|} 1/3 = {4a|/^(< 70 (a))|} 1/3 and c 2 = | > 

and 50 is the inverse of /o- Here F n is the empirical df and a = fo(t) for an interior point t of the 
support of fo. The process V n is the (local) inverse of the slope process. As noted in [3], the process 
V n converges in distribution in the Skorohod topology to the process V, where V is the process of 
locations of maxima, discussed in section [2] (where c\ has an extra factor 2, to obtain V(s) instead 
of V{\s) in the limit). 

The jumps of the limiting process are a stationary locally finite point process, implying that 
the number of jumps of the process V n on an interval [b, c] converges in distribution to the number 
of jumps of V on the same interval. Hence, defining 

Vn(s) = V n ( C ^s), 

we get that the number of jumps of V n on an interval [b, c] converges in distribution to the number 
of jumps of V on an interval of length c|f 1 (c — b). 
Now note that 

V n (s) = C2n 1 / 3 sup ju : ¥ n (t + u) — ¥ n (t) — (a + n~ 1 ^ 3 s)u is maximal j , 

which has, on an interval [b, c], the same number of jumps as the process U n , defined by 

U n (a) = sup{x > : ¥ n (x) — ax is maximal} , a > 0. 

on the interval [a + 6n _1//3 , a + cn -1 / 3 ]. 

We can strengthen this argument somewhat (full details will be given elsewhere), using a Pois- 
sonization argument together with a strong approximation result of [14], to show that the expec- 
tation and variance of the number of jumps of U n on an interval 



\a — n 1 / 3 log n, a + n loe 



n 
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are of order 

2fcic 1 ~ 1 n 1 / 3 logn and 2/c2cJ~ 1 n 1 / 3 logn, 



respectively, where k± and are denned as in Theorem 1.3 



Partitioning the interval [/o(M), /q(0)] into K n intervals of length of order 2n 1 / 3 logn, with 
midpoints dj, we get: 

K n 

EN n ~ fc 1 n 1 / 3 ^{4a i |^( 50 (a,))|}" 1/3 2n- 1 / 3 logn 
i=i 

/■/o(O) rM 
~ fcm 1 / 3 / {4a|/^( ff0 (a))|}- 1/3 da = fcm 1 ^ / { 4 / (x)|/ (x)|}- 1/3 

J/o(A/) JO 

A similar argument, again using Riemann sums approximating the corresponding integral, gives 
the result for the variance. □ 



The conditions in Lemma |3.1| are probably stronger than needed, and we give two examples 
below which may also satisfy the result, but do not satisfy the conditions of the lemma. In the 
first example /q does not stay away from zero on [0, M], and in the second example /o has infinite 
support. 

If f Q ( x ) = 2(1 - x) on [0, 1], we get: 

f 1 |/ (x) 2 /(4/o(x))| 1/3 dx = 3 • 4- 2 / 3 « 1.19055, 
J o 

and hence: 

EN n ~ 2.51 n 1/3 , var(iV n ) ~ 1.225 n 1/3 , n -> oo. 

A simulation of 1000 samples with n = 1000 gave as mean number of the number of jumps N n ^, 
k = 1,...,1000, 

v iooo M 
ft Lk=l "n,k =2-5Q26 1/3 

1000 

and as variance 2 

Efc=T (^.fc ~ N n ) = 238n l/ 3 

999 

If /o is the standard exponential density, we get: 



/ 1 |/o(^) 2 /(4/o(x))| 1/3 ^ = 3-2- 2 / 3 «l. 

JO 



'0 

and hence: 

EN n ~ 3.98477 n 1/3 , var(iV n ) ~ 1.945 n 1/3 , n -> oo, 
whereas 1000 samples of size n = 1000 yielded: 

Y^iooo m 
1000 
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and as variance 



Eg? (N n , 



999 



1.86570 ra 1/3 . 



Analogously to the result for Brownian motion, we have: 

Theorem 3.1 Let N n the number of jumps of f n , where f n is the Grenander estimator, based 
on a sample of size n from /o, where fo satisfies the conditions of Lemma 3.1. Moreover, let the 
constants k\ and k2 be defined as in Theorem 1.3 Then: 

-M 



n 



V6 \ N n - hn 1 / 3 I {/^(x) 2 /(4/o(x))| 1/3 dx 



o 



V 



N(0,a 2 



where N(0,a 2 ) is a normal distribution with expectation zero and variance 



|/^(x) 2 /(4/o(x))| 1/3 dx 



Although the proof proceeds along similar lines as the proof of Theorem 1.3 in section [2j the 
embedding into Brownian motion needs some careful attention, and therefore the details of the 
proof will be given elsewhere. 



4 Concluding remarks 

In the preceding, a central limit result was proved for the number of vertices in an increasing interval 
of the concave majorant of the process {W(t) — t 2 , t G R}, where W is two-sided standard Brownian 
motion, originating from zero. The central limit result involves two constants k\ and £;2 for the 



mean and variance, respectively, see Theorem 1.3 The constant k\ has several representations, for 
example 

k-y = 8£F(0) 2 = §£max (W(t) - t 2 ) » 2.10848, 



see (2.19), where a h-> V(a) is the process of locations of maxima of W(t) — (t — a) , as a function 



of a. From [15] we get: 

h = 2EX(0) + ^'(O) 2 = 2EX(0) + AEV{0) 2 

where X is the concave majorant of the process {W(t) - t 2 , t G R}. This implies as a side result 
the relation 

EX(0) = lE max {W(t) - t 2 } , 



see Remark 12.71 

Much less is known about the constant k2- We used a direct simulation of the vertex process 
to determine this constant, but there is room for improvement here. The approximate value we 
found is close to 1, and our preliminary value is: k2 = 1.029. The basis for the simulation of the 



vertex process directly, without first generating Brownian motion, is given in Theorem 2.4, which 
gives the distribution of the (non-exponential) waiting times between jumps of the process V, as a 
function of a and x, where V(a) = x, and the density of the size of the jumps. The square roots of 
the jump lengths were generated by rejection sampling, and the waiting times between jumps by 
generating standard exponential random variables, and by applying the inverse of the cumulative 
hazard function of the waiting time distribution (again parametrized by x and a) on these. 
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A similar technique of generating vertices of a convex hull was used in |16j and [7] , where convex 
hulls of random points in the plane were studied. The behavior of the least concave majorant of 
Brownian motion minus a parabola has some remarkable analogies with the behavior of the convex 
hulls of points chosen uniformly from the interior of a circle, where we also get central limit theorems 
for the number of vertices, with an expectation and variance which are also both of order n 1//3 , 
if n is the number of points chosen. On the other hand, the behavior of the concave majorant of 
one-sided Brownian motion without drift has analogies with the behavior of the convex hulls of 
points drawn uniformly from a convex polygon, where we get central limit theorems for the number 
of vertices with an asymptotic expectation and variance which are both of order log n, if n is the 
number of points chosen, as shown in [5]. 

Acknowledgements I am grateful to the referees for their careful reading and useful comments. 
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